Classifying Topics of Video Lecture Contents Using Speech Recognition Technology

نویسندگان

  • Jun Park
  • Jihie Kim
چکیده

We explore a speech-based topic classification approach. We generate the transcript of input video lecture based on speech recognition technology and identify the topic by comparing its term-based vector with topic models. The preliminary experiment result shows that the speech-based topic classification works well, with its performance comparable to one that directly uses manual transcripts. The approach also shows robustness against speech recognition errors up to 40.6%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Browsing of Audiovisual Lecture Recordings Based on Automated Speech Recognition

The number of digital lecture video recordings has increased dramatically since recording technology became available. The accessibility and the search inside of this large archive are limited and difficult. Manual annotation and segmentation is time-consuming and useless. A promising approach is based on using the audio layer of a lecture recording to get information about the lecture contents...

متن کامل

Subtopic segmentation in the lecture speech

This paper proposes a method of segmentation that segments lecture videomaterial into subtopics based on speech signals for creation of educational video contents. To represent subtopics of video segments, the text recognized by automatic speech recognition (ASR) from a lecture speech was converted into an index using independent component analysis (ICA) instead of conventional TFIDF. This rese...

متن کامل

Lecture subtopic retrieval by retrieval keyword expansion using subordinate concept

We developed a supporting system for creation of educational video contents. The system automatically segments a lecture video material into subtopics based on speech signals by a statistical model for text segmentation. In this paper, we reports on the result of retrieving the lecture subtopics by keyword expansion using the knowledge of the dictionary, and so on. The keyword expansion using t...

متن کامل

Speech recognition performance of CJLC: corpus of Japanese lecture contents

This paper discusses the speech recognition of Japanese classroom lecture speech. In particular, we mention the influences of microphone differences and the language model differences on the speech recognition performance of classroom lectures. First, we collected actual classroom lecture contents from several universities in Japan. In this paper, we recorded the lecture speech using lapel micr...

متن کامل

Using video-taped examples of standardized patient to teach medical students taking informed consent

Introduction: Medical student should be trained in medicalethics and one of the most essential issues in this field is takinginformed consents. In this research, we compared the effect ofeffectiveness of teaching methods on students’ ability in takinginformed consent from patients.Methods: This semi-experimental study was carried out on fiftyeight subjects from the 4th-year students of Shiraz U...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012